Visual Features for Scene Recognition and Reorientation by Krista

نویسنده

  • Krista Anne Ehinger
چکیده

In this thesis, I investigate how scenes are represented by the human visual system and how observers use visual information to reorient themselves within a space. Scenes, like objects, are three-dimensional spaces that are experienced through twodimensional views and must be recognized from many different angles. Just as people show a preference for canonical views of objects, which best show the object's surfaces and shape, people also show a preference for canonical views of scenes, which show as much of the surrounding scene layout as possible. Unlike objects, scenes are spaces which envelope the observer and thus a large portion of scene processing must take place in peripheral vision. People are able to perform many scene perception tasks, such as determining whether a scene contains an animal, quickly and easily in peripheral vision. This is somewhat surprising because many perceptual tasks with simpler stimuli, such as spotting a randomly-rotated T among randomly-rotated Ls, are not easily performed in the periphery and seem to require focal attention. However, a statistical summary model of peripheral vision, which assumes that the visual system sees a crowded, texture-like representation of the world in the periphery, predicts human performance on scene perception tasks, as well as predicting performance on peripheral tasks with letter stimuli. This peripheral visual representation of a scene may actually be critical for an observer to understand the spatial geometry of their environment. People's ability to reorient by the shape of an environment is impaired when they explore the space with central vision alone, but not when they explore the space with only peripheral vision. This result suggests that peripheral vision is well-designed for navigation: the representation in peripheral vision is compressed, but this compression preserves the scene layout information that is needed for understanding the three-dimensional geometry of a space. Thesis Supervisor: Ruth Rosenholtz, PhD Title: Principal Investigator

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Active, passive and snapshot exploration in a virtual environment: influence on scene memory, reorientation and path memory.

We investigated the importance of active, passive and snapshot exploration on spatial memory in a virtual city. The exploration consisted in traveling along a series of streets. 'Active exploration' was performed by giving directions to the subject who controlled his displacement with a joystick. During 'passive' exploration, the travel was imposed by the computer. Finally, during 'snapshot exp...

متن کامل

Recognition of Visual Events using Spatio-Temporal Information of the Video Signal

Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...

متن کامل

Scene and Object Recognition with Supervised Nonlinear Neighborhood Embedding

Image category recognition is important to access visual information on the level of objects and scene types. In this paper, we develop a Supervised Nonlinear Neighborhood Embedding (SNNE) subspace algorithm of different visual features for object and scene recognition, which learns an adaptive nonlinear subspace by preserving the neighborhood structure of the visual feature space. In the propo...

متن کامل

Aircraft Visual Identification by Neural Networks

In the present paper, an efficient method for three dimensional aircraft pattern recognition is introduced. In this method, a set of simple area based features extracted from silhouette of aerial vehicles are used to recognize an aircraft type from its optical or infrared images taken by a CCD camera or a FLIR sensor. These images can be taken from any direction and distance relative to the fly...

متن کامل

Basic level scene understanding: from labels to structure and beyond Citation

An early goal of computer vision was to build a system that could automatically understand a 3D scene just by looking. This requires not only the ability to extract 3D information from image information alone, but also to handle the large variety of different environments that comprise our visual world. This paper summarizes our recent efforts toward these goals. First, we describe the SUN data...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013